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Abstract 



Order-sorted feature (OSF) terms provide an adequate representation for objects as flexible 
records. They are sorted, attributed, possibly nested, structures, ordered thanks to a subsort 
ordering. Sort definitions offer the functionality of classes imposing structural constraints 
on objects. These constraints involve variable sorting and equations among feature paths, 
including self-reference. Formally, sort definitions may be seen as axioms forming an OSF 
theory. OSF theory unification is the process of normalizing an OSF term, using sort-unfolding 
to enforce structural constraints imposed on sorts by their definitions. It allows objects to 
inherit, and thus abide by, constraints from their classes. A formal system is thus obtained 
that logically models record objects with recursive class definitions accommodating multiple 
inheritance. We show that OSF theory unification is undecidable in general. However, we 
propose a set of confluent normalization rules which is complete for detecting inconsistency of 
an object with respect to an OSF theory. These rules translate into an efficient algorithm using 
structure-sharing and lazy constraint-checking. Furthermore, a subset consisting of all rules 
but one is confluent and terminating. This yields a practical complete normalization strategy, 
as well as an effective compilation scheme. 



Resume 

Les termes a traits et a sortes ordonnees (TSO-termes) fournissent une representation adequate 
pour des objets enregistrements flexibles. Ce sont des structures typees, dotees d'attributs, qui 
peuvent etre imbriquees, et qui sont ordonnees grace a un ordre de sous-sortes. Des definitions 
de sortes correspondent a des declarations de classes imposant des contraintes sur la structure 
des objets. Ces contraintes consistent en sortes de variables et des equations entre les chemins 
d'acces de traits, y compris l'autoreference. Formellement, les definitions de sortes peuvent 
etre vues comme des axiomes formant une TSO-theorie. L' unification modulo une TSO-theorie 
consiste en un processus de normalisation d'un TSO-terme, utilisant le depliage de sortes pour 
appliquer les contraintes structurelles imposees sur les sortes par leurs definitions. Ceci permet 
aux objets d'heriter les contraintes de leurs classes, et done de les satisfaire. Nous obtenons 
ainsi un systeme formel qui modelise logiquement des objets enregistrements, avec definitions 
de classes recursives, et qui accommode l'heritage multiple. Nous montrons que l'unification 
modulo une TSO-theorie est indecidable en general. Cependant, nous proposons un ensemble 
de regies de normalisation confluent qui est complet pour la detection d'objets incoherents 
par rapport a une TSO-theorie. Ces regies expriment un algorithme efficace qui utilise le 
partage de structure et la verification paresseuse des contraintes. De plus, un sous-ensemble, 
contenant toutes les regies sauf une, est confluent et Noetherien. Ceci fournit une strategic de 
normalisation complete et pratique, et un schema effectif de compilation. 
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I think it fair to say that the preoccupation with language 
among anthropologists includes a concern for expressivity 
and style as well as lexicology and syntax... Grammatical 
slips, or deviations from the idioms, can be detected by 
everyone, even the illiterate — unless the "errors" belong 
to a popular dialect, in which case they are not erroneous — 
because some things are generally considered to be wrong 
and some things cannot be said. 

Robert Darnton, The Great Cat Massacre 

1 Synopsis 

Before we develop the technical details of our method, it is important that we give the 
reader an informal motivation, assuming no background. We also relate our work to others, 
and outline the organization of the remainder of the paper. 

1.1 Motivation of problem 

In [3], ^-terms were proposed as flexible record structures for logic programming. However, 
■^-terms are of wider interest. Since they are a generalization of first-order terms, and since 
the latter are the pervasive data structures used by symbolic programming languages, whether 
based on predicate or equational logic, or pattern-directed A-calculus, the more flexible ^-terms 
offer an interesting alternative. 

The easiest way to describe a ^-term is with an example. Here is a ^-term that may be used 
to denote a generic person object: 

P : person(name id(first string, 

last 5 : string), 

age 30, 

spouse person(name id(last S), 
spouse =>• P)). 

In words: a 30 year-old person who has a name in which the first and last parts are strings, 
and whose spouse is a person sharing his or her last name, that latter person's spouse being the 
first person in question. 

This expression looks like a record structure. Like a typical record, it has field names; i.e., 
the symbols on the left of =>. We call these feature symbols. In contrast with conventional 
records, however, ^-terms can carry more information. Namely, the fields are attached to sort 
symbols (e.g., person, id, string, 30, etc.). These sorts may indifferently denote individual 
values (e.g., 30) or sets of values (e.g., person, string). In fact, values are assimilated 
to singleton-denoting sorts. Sorts are partially ordered so as to reflect set inclusion; e.g., 
employee < person means that all employees are persons. Finally, sharing of structure can be 
expressed with variables (e.g., P and 5). This sharing may be circular (e.g., P). 

Clearly, a first-order term can be viewed as a particular ^-term. Namely, considering 
only singleton sorts, a sort ordering reduced to syntactic equality, and numbers as features, 
a term f(ti, ... ,t n ) is the ^-term/(l =>- ...,«=>- t n ). In fact, ^-terms enjoy the same 
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powerful operations as first-order terms: matching (as, say, in term-rewriting systems, or ML 
function definitions) and unification (as, say, in Prolog, or equational narrowing). This makes 
them quite a more flexible data structure for symbolic programming since both operations 
take into account the partial-order on sorts and extensibility with features. Therefore, they 
can supplement first-order terms in a functional programming language or logic programming 
language [3,4]. In this manner, a form of single inheritance (matching) and multiple inheritance 
(unification) is obtained cleanly and efficiently. Pattern-directed definition of functions or 
predicates will indeed be inherited along the partial order of sorts (the sort hierarchy) thanks 
to matching or unification. 

In object-oriented programming, typically, objects do not enjoy the expressivity offered 
by ^-terms. On the other hand, they are made according to blueprints specified as class 
definitions. A class acts as a template, restricting the aspect of the objects that are its instances. 
Our intention is to conceive such a convenience for ^-terms and, in so doing, expand the 
capability of the constraining effect of classes on objects. We propose to achieve this using 
sort definitions. A sort definition associates a ^-term structure to a sort. Intuitively, one 
may then see a sort as an abbreviation of a more complex structure. Hence, a sort definition 
specifies a template that an object of this sort must abide by, whenever it uses any part of the 
structure appearing in the ^-term defining the sort. 

For example, consider the ^-term: 1 

person(name =>■ ~\~(last =>■ string), 
spouse =>■ T [spouse =>■ T, 

name =>■ ~\~(last =>■ "smith"))). 

Without sort definitions, there is no reason to expect that this structure should be incomplete, or 
inconsistent, as intended. Let us now define the sort person as an abbreviation of the structure: 

P : person(name =>■ id(first =>■ string, 

last =>■ S : string), 
spouse =>■ person(name =>■ id(last =>■ 5), 
spouse =>- P)). 

This definition of the sort person expresses the expectation whereby, whenever a person object 
has features name and spouse, these should lead to objects of sort id and person, respectively. 
Moreover, if the features first and last are present in the object indicated by name, then they 
should be of sort string. Also, if a person object had sufficient structure as to involve feature 
paths name.last and spouse. name. last, then these two paths should lead to the same object. 
And so on. 

For example, with this sort definition, the person object with last name "smith" above 
should be made to comply with the definition template by being normalized into the term: 2 

X : person[name =>■ id{last =>■ N : "smith"), 
spouse =>■ person[spouse =>■ X, 

name =>• id{last =>• N))). 

'The sort symbol T is the top of the partial order, the sort of all objects. 
2 In this example, it is assumed, of course, that " smith" <string. 
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Note that in our approach, we do not wish to enforce the explicit presence of the complete 
generic structure of a sort's definition in every object of that sort. Rather, we want to 
enforce the minimal restrictions that will guarantee that every object of a given sort denotes 
the largest possible set consistent with the sort's definition. For instance, we could use 
person(hobby =>- movie -going) without worrying about violating the template for person 
since the feature hobby is not constrained by the definition of person. 

This lazy inheritance of structural constraints from the class template into an object's 
structure is invaluable for efficiency reasons. Indeed, if all the (possibly voluminous) template 
structure of a sort were to be systematically expanded into an object of this sort that uses only 
a tiny portion of it, space and time would be wasted. More importantly, lazy inheritance is a 
way to ensure termination of consistency checking. For example, the sort definition of person 
above is recursive, as it involves the sort person in its body. Completely expanding these sorts 
into their templates would go on for ever. 

An incidental benefit of sort-unfolding in the context of a sort semilattice is what we call 
proof memoing. Namely, once the definition of a sort for a variable X has been unfolded, and 
the attached constraints proven for X, this proof is automatically and efficiently recorded by the 
expanded sort. The accumulation of proofs corresponds exactly to the greatest lower bound 
operation. Besides the evident advantage of not having to repeat computations, this memoing 
phenomenon accommodates expressions which otherwise would loop. Let us take a small 
example to illustrate this point. Lists can be specified by declaring nil and cons to be subsorts of 
the sort list and by defining for the sort cons the template ^-term cons(head =>• T , tail =>- list) . 
Now, consider the expression X : [\\X\, the circular list containing the one element 1 — i.e., 
desugared as X : cons(head =>■ 1, tail =>■ X). Verifying that X is a list, since it is the tail of a 
cons, terminates immediately on the grounds that X has already been memoized to be a cons, 
and cons < list. In contrast, the semantically equivalent Prolog program with two clauses: 
list([]) and list([H\T\) :- list(t) would make the goal list(X : [l\X]) loop. 

1 .2 Overview of our approach 

In this paper we present a formal and practical solution for the problem of checking the 
consistency of a ^-term object modulo a sort hierarchy of structural class templates. We 
formalize the problem in first-order logic: objects as OSF constraint formulae, classes as 
axioms defining an OSF theory, class inheritance as testing the satisfiability of an OSF 
constraint in a model of the OSF theory. We call this problem OSF theory unification. 

We give conditions for the existence of non-trivial models for OSF theories, and prove the 
undecidability of the OSF theory unification problem. We also show that failure of OSF theory 
unification (i.e., non-satisfiability of an OSF term modulo an OSF theory) is semi-decidable. 
We propose a system of ten normalization rules that is complete for detecting incompatibility 
of an object with respect to an OSF theory; i.e., checking non-satisfiability of a constraint 
in a model of the axioms. This system specifies the third Turing-complete calculus used in 
LIFE [2], besides the logical and the functional one. 

As a calculus, the ten-rule system enjoys an interesting property of consisting of two 
complementary rule subsets: a system of nine confluent and terminating weak rules, and 
one additional strong rule, whose addition to the other rules preserves confluence, but loses 
termination. There are two great consequences of this property: (1) it yields a complete 
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normalization strategy consisting of repeatedly normalizing a term first with the terminating 
rules, and then apply, if at all necessary, the tenth rule; and (2) it provides a compilation scheme 
for an OSF theory since all sort definitions of the theory can be normalized with respect to the 
theory itself using the weak rules. 

1 .3 Relation to other work 

Our system is unique in that it comes with a semantic foundation and constitutes the first 
proven correct and complete, practical algorithm for the problem of unfolding sort definitions 
in order-sorted feature structures. 

The problem was first already addressed in [1]. A significant difference is that the method 
was restricted to single inheritance and was non-lazy. Operationally, it amounted to a 
breadth-first expansion of all sorts and was not very practical. 

Concerning undecidability of OSF theory unification, a related, but different result was 
proven by Gert Smolka in [13]. The undecidability of our problem uses explicitly the existence 
of a model satisfying the sort definitions while this is overlooked in [13] (cf., also, Footnote 6). 

As for unfolding sort definitions, we know of two other works, both relevant to computational 
linguistics: that of Bob Carpenter and that of Martin Emele and Remi Zajac. Bob Carpenter [6] 
proposed a simple type-checking of a system of sort definitions for feature terms that are 
essentially a variation of ^-terms. However, besides being purely operational, this system 
is limited to the simple case where sort definitions specify sort constraints on features 
alone, without feature compositions and, more importantly, without shared variables imposing 
coreference constraints on feature paths. On the other hand, his formalism handles partial 
features, while what we present works with total features. As it turns out, our system can be 
made to handle partial features with the addition of one simple decidable rule whose effect is 
to narrow the sort of a variable to intersect a feature's domain when that feature is applied to it. 
Therefore, the system described in [6] is a special case of what we present here. In the recent 
book [7], Chapter 15 deals with "recursive type constraint systems" extending that of [1] to 
be of the kind we study here. He gives a complete resolution method similar to Horn clause 
resolution. That method differs from ours in that it is not lazy. 

The work of Emele and Zajac on typed unification grammars [10] is actually quite close to 
what we report here. Their work is an elaboration of [1], with the assumption that features 
are partial. Their main contribution has been the study of clever algorithms to carry out type 
unfolding efficiently. In [9], Martin Emele describes an implementation that shares many 
insights with the method that we describe here. In particular, he uses structure- sharing to avoid 
much copying overhead, and whenever copying must be done, it is done such that no redundant 
copying is performed. However, his technique differs from ours, in that when copying is 
done, all the defined features of a sort are brought into the formula where it appears. Most 
importantly, Emele 's algorithm is not explained in formal terms, let alone proven correct. No 
semantics is provided, and no clear delineation is made, as our rules do, between a maximal 
decidable subset of cases and the complete normalization. 

The functional programming community has been using variations on, and generalizations 
of, an extensible record formalism pioneered by Luca Cardelli [5] and used to endow 
polymorphically typed languages of the ML family with a form of multiple inheritance [14, 12]. 
Records are viewed as partial functions from field label symbols to values. Record types are 
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defined similarly as partial functions from labels to types. What corresponds to unification 
in our formalism is rendered there as record concatenation. In contrast to our (possibly 
circular) use of logical variables and unification, coreference constraints are not supported, 
and self -reference is handled using a special fix-point functional abstraction. Subtyping in the 
Cardelli style of records is checked using static inference rules that are essentially performing 
the kind of verification done by Carpenter's system [6], but made more complicated by the 
presence of polymorphic function types. It is hence very hard to compare that trend of work 
and ours because of these differences in the nature, restriction, and use of records. 

1 .4 Organization of paper 

Section 2 presents our formalization of OSF theories and recounts essential facts about 
them. Section 3, the crux of the paper, presents the OSF normalization system and its formal 
properties. We have adjoined an appendix: Section A gives a detailed example of OSF 
theory normalization, and Section B reintroduces the necessary OSF formalism concepts and 
terminology that we need. 

2 OSF Theories 
2.1 OSF Formalism 

Let us first recall very briefly a few OSF formalism notions and notation. 3 We shall use a 
set of sort symbols S, equipped with partial order < and meet operation A, together with a set 
T of feature symbols. These two sets define an OSF signature and generate a set of OSF terms 
with the following context-free rule: 

t ::= X : s(l\ t, . . . ,l„ i) 

where X is a variable from a set V, s is a sort in S, and If 6 J 7 , n > 0. The variable X is called 
the term's root variable, referred to as Root(t) for such a term t. The sort s is called the term's 
root sort, or its principal sort. We shall refer to the sort of a variable V occurring in a ^-term t 
as Sort t ( V), or simply Sort[v) if the term is clear from the context. 

An OSF constraint is one of (1) X : s, (2) X = X', or (3) XI = X', where X and X' are 
variables in V, s is a sort in S, and I is a feature in T . An OSF clause is a set of OSF constraints 
(interpreted as their conjunction). 

Any OSF term t is equivalently expressible as an OSF clause, denoted c/)(t), called its 
dissolved form. We shall often confuse an OSF term t for its dissolved form, writing t where 
we mean (j>{t). We will use a shorthand notation to express that a variable X is constrained by 
an OSF term t. Namely, we denote by C t [X] the formula X = Root(t) & and by Cf[X] 
the formula 3Var(t) C t [X]. 

Syntactically consistent OSF terms are said to be in normal form, and called ^-terms. They 
comprise a set called It is natural to extend < and A from the sort signature to the set 1^, 
where they realize matching and unification, respectively. Unification of OSF terms is done 
thanks to a normalization procedure. The rules to normalize OSF terms are given in Figure 1 . 

3 The reader who is not familiar with the OSF formalism as defined in [4] will find sufficient details in appendix 
Section B. Please refer there if, although we tried to avoid it, a concept is used without having been previously 
defined. 
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Sort Intersection: 




6 & X ■ s & X ■ s' 


(1) 






4>&X:s As' 




Inconsistent Sort: 




<j)&.X\ _L 


(2) 






X : _L 




Variable Elimination: 




<j) & X = z' 


(3) 


if X / X' and X £ Var((j>) 




(j>[X'/X] &X = X' 




Feature Decomposition: 




(/) & XI = X' & XI = X" 


(4) 




(/) & XI = X' & X' = X" 



Figure 1 : OSF Clause Normalization Rules 
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2.2 Sort Definitions 

As explained in the previous section, we may view a class template as a ^-term. Hence, to 
define a sort s as a class is to associate to this sort a ^-term whose root sort is s. Informally, an 
OSF theory is a set of sort definitions, each of which is a ^-term whose root sort is the name 
of the class defined by that sort. 

Formally, an OSF theory is a function 0 : S i-> & such that Sort(Root(&(s))) = s for all 
s e S and @(t) = T, & (±) = _L. The OSF theory 0 = l s which is the identity on S is 
called the empty OSF theory. 

An OSF theory 0 is order-consistent if it is monotonic; i.e., if Vs, s' 6 S, s < s' =>■ 0[s) < 
0{s'). Recall that < is defined on ^-terms (see Definition 3 on Page 22) extending the 
ordering on sorts. 

We shall always assume the OSF theory 0 to be order-consistent. By setting 0(s) = 
/\s<s' ®{ s ') if different from _L, it is easily possible to normalize a non order-consistent theory 
into an equivalent order-consistent one, if it exists. 

Clearly, an OSF algebra is a logical first-order structure A interpreting sort symbols as 
unary predicates, i.e., sets, and feature symbols as unary functions, and satisfying the axioms 
specified by the sort hierarchy. Namely, for all sorts s, s', s" such that s A s' = s", the following 
axiom is valid in A: 

Axiom^,-,,^ : VX (X : s & X : s' -> X : s"). 

The name OSF theory is justified from the fact that the function 0 specifies a system of 
axioms; i.e., for each s £ S, the axiom: 

Axiom [@{s)] : VX (X : s <-» C| (s) (x)) 

expressing that an element in the sort s necessarily satisfies the constraints attached to s (the 
constraints coming from the dissolved ^-term assigned to s by 0). Note that 0 (s) contains the 
constraint Root(0(s)) : s. Thus, the equivalence (<->) inAxiom^^ is, in fact, an implication 

(-)■ 

The class of all (9 -OSF algebras is the class of all OSF algebras such that s A = 
[[©(s)]]" 4 . Thus, 0 specifies a first-order theory, namely through the system of all the axioms 
Ax/om[ vAs ./ =s »] and Ax/om[@( s ,)]. The notion of ©-satisfiability refers to satisfiability in a (9-OSF 
algebra; i.e., in a logical first-order structure where the axioms above hold. 

We will see next that such a structure actually exists (under the overall assumption that 0 is 
order-consistent). We first define the OSF algebra l^o of possibly infinite OSF graphs. 

An OSF graph g = {V,E) consists of nodes denoted by mutually distinct variables in V, 
i.e., V C V, and arcs between them, i.e., E C V x V. It has a distinguished node, its root, 
from which all its other nodes are reachable. All nodes and arcs of an OSF graph are labeled. 
Nodes are labeled with non-bottom sorts and arcs are labeled with feature symbols such that 
the same feature may not be attributed to two distinct arcs coming from the same node. 

The set of all OSF graphs forms an OSF algebra: 
• the OSF graph denotation of a sort s is the set of all graphs whose root sort is equal to or 

less than s; 
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• applying the feature I to a graph g rooted in X is the maximal subgraph of g rooted in X' 
if g has an arc labeled I between nodes X and X'\ otherwise, it is a one-node arcless graph 
whose node is a new distinct variable X^ g labeled with T. 

We next define the (possibly infinite) OSF clauses Unfold[(/)) obtained from an OSF 
clause (j> by unfolding all sort definitions. Formally, Unfold{(/)) = U«>o Unfold n ((/)) , where 
Unfoldo((/)) = (j> and: 

Unfoldn+i(4>) = Unfoldn(4>) U {C @{s) [X] \ X:s £ Unfoldjjt)} . 

We assume that the variables in the OSF constraints added to Unfold n ((/)) , Var(&(s)) are new 
for each unfolded sort constraint X : s. 

We define two formulae to be 0 -equivalent if they are equivalent modulo the axioms 
specified by 0 and the sort hierarchy and modulo existential quantification of variables in only 
either of the formulae. Thus, cj> and Unfoldi{(/)) , and even Unfold((/)), are ©-equivalent. The 
next lemma compares satisfiability of cj> and Unfold((/)) in different structures. 

Lemma 1 An OSF clause (j> is 0-satisfiable if and only if Unfold((/)) is satisfiable. 

Proof: Every 0-OSF algebra where <j> is satisfiable is in particular an OSF algebra where Unfold((f) 
is satisfiable. Vice versa, the domain of an OSF algebra where Unfold(<fr) is satisfiable can be 
"trimmed down" to the domain of a 0-OSF algebra (by including only elements which are values of 
the valuations which make Unfold((f) hold true) such that Axiomy@^ holds for every sort s which 
occurs in Unfold((j>), and <j> is satisfiable. Since 0 is order-consistent, the interpretation of the sorts 
can be chosen as the restriction of the old interpretation to the new domain. | 

Definition 1 (Solved OSF Clauses) A (possibly infinite) OSF clause <j> is called solved if 
for every variable X, (f> contains: 

• at most one sort constraint of the form X : s, with _L < s; and, 

• at most one feature constraint of the form X.l = X' for each I; 

• ifX = X' G (j>, then X does not appear in any other OSF constraint in (j>. 

Lemma 2 A (possibly infinite) OSF clause (f> in solved form is satisfiable in l^o, the OSF 
algebra of possibly infinite OSF graphs. 

Proof: Let X be a variable in cj> where X is not on the left side of the symbol = anywhere in cj>. 
We define the valuation a on X as the graph (V, E) with the root node X, where V = \J„ >0 V„, 
E = U>o £ »> y o = {X}, E 0 = 0, V„ +1 = V„ U {Z | Y.i = Z e <A for some Y e V„}, E~ +1 = 
E„ U {(Y, Z) | Y.i = Z e $ for some Y <E V„}. A node Y is labeled by s if Y : s <E 4> for some s <ES, 
and by T otherwise. An arc (Y, Z) is labeled by i if Y.i = Zef 

If X = X' e cj>, then we set a(X) = a(X'). Clearly, every OSF constraint of <j> holds in W 0 under the 
valuation a. | 

Definition 2 (0-SOlved OSF Clauses) An OSF clause § is called 0 -solved if the OSF clause 
Unfoldi[(/)), obtained by unfolding all sort definitions once, can be normalized into a solved 
form which contains (/>, and no other constraints whose variables are those from (/>. 
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That is, if the solved form contains X : s, then either X : s 6 (j> or X ^ Var{(/)). Similarly, if it 
contains Y = X, then either Y = X£(/)orY^ Var((f) ; and if it contains X.I == Y, then either 
XI = Y G <t>ovY £ Var((£). 

Thus, the OSF clause cj> is ©-solved if the OSF clause: 

Unfold^) = (j>U |J {C @{s) [X]} 

X:sect> 

can be transformed, by applications of Rule 4, into an OSF constraint (f>' of the form 
^' = ^U^iU^2 where (j>\ contains only equalities of the form Y = X where X 6 Var{(/)) and 
y ^ Var(^>) and (j>i is an OSF constraint in solved form whose variables are new for (j>; i.e., 
Var((j)) |~l Var((j>2) = 0. 

The OSF theory 0 is well-formed if, for every s £ S, the dissolved ^A-term 0(s) is in 
©-solved form. From now on we are interested only in well-formed (and order-consistent) 
OSF theories. 

We introduce next the OSF algebra The domain of <^@, and the interpretation of the 
features, are the ones of l^o- If s 6 S is a sort, then: 

s*® ={ge D*° | <F 0 , a |= Unfold{X : s), a(x) = g}. 

In the special case of the empty theory, is the OSF graph algebra l^o- 

As in the case of OSF unification, i.e., of satisfiability of OSF clauses in OSF algebras, 
it is sufficient to consider ©-satisfiability in one particular 0-OSF algebra, here This 
characterizes ty® as canonical 0-OSF algebra (meaning: any (9-satisfiable OSF clause is 
satisfiable in &@). It follows from the fact that one can easily construct a homomorphism from 
any 0 -algebra into (and, thus, is weakly final (cf., [4]) in the category of all 0-OSF 
algebras). 

Proposition 1 Given a well-formed order-consistent OSF theory 0, a 0-solved OSF clause 
is satisfiable in In particular, is a 0-OSF algebra, i.e., a model of the axioms specified 
by the sort hierarchy (S, <, A) and the OSF theory 0. 

Proof: Since, for each sort s £ S, 0(s) is ©-solved, Unfold„(<fr) is ©-solved, for all n. In particular, 
for all n Unfold„((j>), and hence also Unfold{(f), is ©-equivalent to an OSF clause in solved form. 
Thus, according to Lemma 2, Unfold(<fr) is satisfiable in Wq, the OSF algebra of possibly infinite 
OSF graphs. Say, Unfold(cj)) holds under the valuation a. Since all sort definitions in Unfold(cj)) 
are unfolded, each graph g rooted in a node labeled by a sort s lies in the -denotation of s; i.e., 
g £ s* 8 (. . . C Thus, a is in particular a W@ -valuation. That is, Unfold(<fr) and, hence cj> C cj>' , 
are satisfiable in W @ . | 

3 OSF Theory Unification 

We next investigate the denotational and operational semantics of the inheritance mechanism 
from a class template structure into an object instance. We call this mechanism OSF Theory 
Unification since it is the solving of OSF clauses in the presence of an OSF theory. This 
is a generalization of OSF unification, the solving of OSF clauses in the empty theory (cf, 
Figure 1). 
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Formally, OSF Theory Unification is the procedure which ©-solves an OSF clause <j>; i.e., it 
transforms <j> into a ©-equivalent OSF clause (j)' which is either _L or in ©-solved form (and, 
in this case, exhibits it). 

We will show that such a procedure exists that transforms cj> successively until either _L or a 
©-solved form is obtained. If <j> is ©-equivalent to _L, then _L is reachable in a finite number of 
steps. Generally, however, there exists no such procedure that is always terminating. Indeed, 
if such a procedure existed, then according to Proposition 1, there would be an algorithm 
deciding whether an OSF constraint cj> is satisfiable in the 0-OSF algebra <^@. This, however, 
is impossible as Theorem 1 will show. 

Next, we will informally describe and motivate the effect of each rule. Before doing that 
we need to define some additional notation. We will follow strict naming conventions for 
variables in order to identify them. We shall use X's for variables appearing in a formula being 
normalized, and call these global or formula variables. We shall use Fs for variables in the 
theory, and call these local or theory variables. 

The theory variables appearing in a sort definition are all local to this definition 

alone. Thus, without loss of generality, we shall assume distinct names for all variables 
across sort definitions. More precisely, s ^ s' =>- Var(&(s)) n Var(&(s')) = 0. Let 
Var(@) = \J seS Var(0(s)) denote the set of all theory variables. 

We shall use Z's for new global variables introduced into a formula being normalized. 
Finally, the theory variable at the root of @(s), the definition of a sort s, will be identified as 
Y s . We will denote by Roots(&) the set of all root theory variables. Local and global variables 
are always assumed disjoint. 

Two theory variables Y and Y' are said to be path-compatible (noted Y JJ. Y') if they lie on 
the same occurrence path in the definitions where they occur. Formally, Y JJ. Y' if and only if 
Occ(Y) n Occ(Y') jfc 0. 4 

We will denote by Iq(y) the theory variable Y', if it exists, such that l(Y) = Y' in some 
sort definition 

Note that Roots(&) is in bijection with S. In particular, the operation A on S can be defined 
on Roots(0) as Y s A Y s i — Y s /^ s i. In fact, the operation A extends homomorphically to all 
Var(©) by defining it inductively as follows: 



Yi AY 2 



JW if Yi = Y s and Y 2 = Y s >; 

£@(Y[ A Y' 2 ) if Yx 4J. Y 2 and Y t = 4>(^), for f = 1,2; 

Y± otherwise. 



This operation is well-defined (1) because 0 is order-consistent, and (2) thanks to the fact that 
path-compatible variables must lie at the end of a same feature path from their definitions' 
roots and the meet (A) is defined on root variables. 

The normalization rules that perform OSF theory unification are given in Figures 2, 3, and 4 
and are called OSF theory normalization rules. 5 The rules in Figures 2 and 3 alone are called 
the weak (OSF theory) normalization rules. As for plain OSF normalization, each rule 
specifies a transformation of the pattern in the numerator into that of the denominator. While 
the rules of Figure 1 transform OSF clauses, the new rules transform contexted OSF clauses. 



See Section B for a definition of Occ. 



5 A full example of sort-unfolding using these rules is detailed in appendix Section A. 
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(0) 



(1) 



(2) 



(4) 



Frame Allocation: 

r \- X:s&cl> 

r U {{X\Y S }} h X : s & (j> 
Sort Intersection: 

r u {{x\y s /} uf} \- x-.s &x-.s' 

r U U F} h X : s A s' & </> 

Inconsistent Sort: 

r u{my±}uF}h^ 



0 hi 



Variable Elimination: 



ifX\Y s , £ F, for any s' £ 5, 
for all F £ T 



r \-x = x' &(!> 

(3) ifX/X'andX£ Var(r)U Var[4>) 

r[x'/x] h- z = z' & ^[x'/x] 



Feature Decomposition: 

r h x.i = x' & xi = x" &(/) 

r^xi = x' &x' = x" & $ 



Figure 2: Weak OSF Theory Normalization Rules — Empty Theory 
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Feature Inheritance: 



rutauf} hx.i = x'&(/) 

(5) r if 1 @ {Y) = ¥' andXV £ F 

r U Uf)h X.I = X' & X' : Sort(Y') & <j> 



(6) 



Frame Merging: 

r u{W}uf I {x\iV}uf'}^ 
r U{{^\W}ufuf'} h</> 

Frame Reduction: 

r \j{{x\y,x\y'}uf\ \-<i> 

(7) >- ifF^y 

r u {AFAJ"}uf}h,i 



(8) 



Theory Coreference: 

r \J {{X\Y,X'\Y} u f} \-4> 

r\j{{X\Y}UF} hX = X'&</> 



Figure 3: Weak OSF Theory Normalization Rules — Non-Empty Theory 



Theory Feature Closure: 

r \~ <j) if X\Y £ F andX\y' £ F' for some F, F' £ T, 

(9) and both 1 @ {Y), 1 @ {Y') exist 

r h X.l = Z & (/) (Z is a new variable) 



Figure 4: Strong OSF Theory Normalization Rule 
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A contexted clause is a formula of the form r h <j> where <j> is an OSF clause and r, called the 
context, is a set of frames. A frame is a set of pairs of variables X\Y (read "X stands for F") 
where X G Var{(/)) and Y G Vfar(@). We write simply (j> for 0 h 

The rules proceed to normalize a formula from an originally empty context, creating at most 
one frame per formula variable. These rules maintain frames so that there is exactly one root 
theory variable per frame at any moment. The global variable in a frame that stands for the 
root local variable is called the frame's principal variable. Intuitively, one may think of a 
context as a set of activation frames, each being a local environment for a variable occurring 
in the formula (j), the pairs indicating what global variables stand for what local variables. 
Alternatively, one can think of a frame as the materialization of an object instance. Thus, 
the rules must ensure that a global variable is eventually principal in at most one frame. In 
addition, note that the rules will materialize only what is necessary to ensure that the instance 
is consistent with the class definition. 

Rule (0) simply spawns a new frame for a global variable if none exists for it yet in the current 
context. This is akin to creating an instance in object-oriented programming. Rules (l)-(4) 
do exactly the same work as Rules (l)-(4) in Figure 1. The only difference is that they keep 
track of the sort information in the context r using root theory variables. Rule (5) ensures that 
whenever a feature is used in the formula it fits the constraints, if any, imposed on it by the 
theory. Rule (6) recognizes that a global variable is principal in two frames and merges them. 
This case arises from variable elimination and is that of two originally distinct global variables 
that are later made to corefer. Rule (7) determines that the same global variable stands for two 
distinct path-compatible local variables within the same frame. Therefore, the global variable 
must stand for the common lower bound of these two local variables. Rule (8) enforces an 
equation of paths as prescribed by the theory when it finds that two distinct global variables 
stand for the same local variable in the same frame. 

Rule (9) looks more complex than Rules (0)-(8). In fact, it simply completes the enforcing of 
functionality of features. Functionality of a feature I means that if X = X' then l{X) =l(X'). 
Rule (4) enforces feature functionality in the formula alone as I is applied at two occurrences 
of the same variable in the formula. Rule (5) does the same for the case when one occurrence 
is in the formula and the other is in the theory on the corresponding local variable. The only 
case left is when it is found that, even though a global variable is not being applied a feature I 
explicitly in the formula, it still may stand for two theory variables both being applied that very 
feature I. We need to check whether the induced equality between the two theory variables 
leads to an inconsistency. Therefore, a new global variable must be created and injected into 
the formula as the result of applying I to that global variable. This is done by an application 
of Rule (9). After that, Rule (5) will do the right thing, bridging the gap between the two local 
variables using this new global variable. In fact, it guarantees the transitivity of congruence 
of feature path equations as per the theory. It is this rule that may make the normalization 
algorithm diverge on consistent formulae as there is, in general, no way to predict how deep 
along a feature path an inconsistency might arise. This is indeed confirmed by the following 
fact. 6 



5 A related, but different result can be found in [13] where well-formedness, order-consistency and the existence 
of one generic model of an OSF theory (there called a system a recursive sort equations) are not considered. In fact, 
without Proposition 1, we do not know whether there is any OSF constraint which is satisfiable modulo a system 
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Theorem 1 Given a well-formed order-consistent OSF theory 0, the problem of the satisfia- 
bility of an OSF constraint in the 0-OSF algebra is generally undecidable. 

Proof: We show that a complete OSF Theory Unification algorithm is also a decision procedure for 
the word problem for Thue systems of equations on strings [11]. Consider a finite alphabet £ and 
a finite set E C S* x S* of equations of words on S. The word problem that consists in deciding 
whether two words w\ and wi in S* are equal modulo the equations in E can be encoded as the 
following OSF theory unification problem. Let us take for sorts <S = {T, s, 0, 1, _L} with 0 < s, 
1 < s, and 0 A 1 = _L, and for the features T = S. Let us define 0 such that 0(s) is the V'-term 
whose variables are all sorted with s and such that to each equation u = v in E corresponds one of 
two occurrence paths from the root that meet in a common variable at their end. 

Let us take an example to explicate this encoding. Consider the system of equations E = {be = 
ed, ae = b,bd= de}. It is encoded as an OSF theory over the sorts of S above and the set of features 
T = {a, b, c, d, e}. The sort definitions are: 

0(s) = s{b =^ Yl : s{c => Y2 : s, d =^ Y3 : s), 
e=> s{d=> Y2), 
a s(e Yl), 
d=>s{e => 73)). 

As for 0(0) and 0(1), they both inherit the exact same structure as 0 (s) except for the root sort since 
Sort(Root(0(O))) = 0, and Sort{Root{0{\))) = 1. Clearly, 0 is a well-formed and order-consistent 
OSF theory. 

Now, to decide whether an equality w\ = wi holds modulo the equations, it suffices to normalize the 
OSF term consisting of just two non-coreferring occurrence paths w\ and W2, and whose root sort is 
s and all other sorts are T except for the tips of the two paths which are 0 and 1 . If the normalization 
algorithm is complete, then it will necessarily make the two paths corefer (and thus end with a sort 
clash, i.e., normalize the dissolved V'-term to the equivalent OSF clause _L) if and only if the equality 
w\ = W2 holds. Otherwise, i.e., if and only if the equality does not hold, it will normalize the 
dissolved ^-term to an equivalent 0-solved OSF clause and, thus, exhibit its ©-satisfiability. 

For example, to decide whether abc = de modulo the above equations, we need to check whether the 
■0-term: 

s(a => T{b => T(c => 0)), 
d^T{e^ 1)) 

(i.e., the OSF clause obtained by dissolving it) is not satisfiable modulo the OSF theory 0 given 
above. | 

Lemma 3 If (j> is transformed into r \- </>' by the (strong) OSF theory normalization rules, 
then (j> is 0 -equivalent to (j>'. 

Proof: For a contexted formula T \- cj>, let us define the OSF clause: 

[r h 4>] = 4> u \J{c @{s) [X] & Yi = Xi & . . . & Y n = x„} 

of sort definitions. Thus, the result in [13] is on a test of satisfiability in all 0-OSF algebras, and its proof has to 
provide the construction of a particular one. 
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where the big union is taken over the frames {X\F S , Xi\Yi, . . . , X„\Y„} £ T. 

The variables in [X] & Y 1 = X 1 & . . . & Y n = X„ are taken new for each of these frames. 

Clearly, cj> is ©-equivalent to [T h <fr]. 

If r h cj) is transformed to J" h then |T h is ©-equivalent to [T" h This can be verified 
by inspection of each of the OSF theory normalization rules. For each application by one of these, 
we will give corresponding ©-equivalence transformations on [r h <j>]. These will either consist of 
adding [X] (again, obtained by naming its variables apart), or of applications of one of the rules 
of Figure 1. Since these are all equivalence transformations, [T h <fr] is equivalent, and thus also 
©-equivalent, to [/" h </>']. 

Each application of Rule (0) of Figure 2 adds a frame {X\Ts} to the context of T h cj>. The 
corresponding transformation on the OSF clause [T h cj)] consists of adding the OSF clause [X]. 
One hereby obtains a ©-equivalent OSF clause. 

Clearly, each step by application of Rule (;) of Figure 2 to r h <j> corresponds to one step of 
application of Rule (/) of Figure 1 to |T h <f>], for i = 1 , . . . , 4. In case of Rule (1), if s A s 1 is a strict 
subsort of s', then, in addition, C 0 ( sA5 /)[X] has to be added. 

An application of Rule (5) of Figure 3 to T h cj> corresponds to one variable elimination step, 
followed by one step of application of Rule (4) of Figure 1 (the feature constraint Y.l = Y' is part of 
p), followed by another variable elimination step to [T h (/>]. 

An application of Rule (6) of Figure 3 to T h cj> yielding J" h cj>' corresponds to two variable 
elimination steps, followed by one step of application of Rule (1) of Figure 1 to |T h cj)]. We add the 
OSF clause C@( sAs ')[X], hereby obtaining the ©-equivalent OSF clause |T' h <fr']. 

An application of Rule (7) of Figure 3 corresponds to one variable elimination step, followed by one 
step of application of Rule (4) of Figure 1 (the feature constraints X' .1 = X and X' .1 = Y are part of 
the derived OSF clause). 

An application of Rule (8) of Figure 3 corresponds to several variable elimination steps. 

Finally, an application of Rule (9) in Figure 4 adds a feature constraint X.l = Z with a new variable 
Z. Clearly, [r h cj)] is ©-equivalent to [P h cj) & X.l = Z]. | 

Theorem 2 If (j> is transformed into the non-bottom normal form JV l~ <I>N by the (strong) OSF 
theory normalization rules, then (f> N is an OSF clause in 0 -solved form which is 0 -equivalent 
to (f). 

In particular, because we assume 0 to be well-formed and order-consistent, (j> is, then, 
(9-satisnable (e.g., in <^@). Of course, if (j> is transformed into fa = _L, then (j> is not 
(9-satisfiable. 

Proof: It is easy to see that, if T N h <fr N is in non-bottom normal form, then [r N h <fr N ] is in 
solved form. Namely, otherwise one could apply an OSF clause normalization rule from Figure 1 
to [Tjv H 4>n\, this application could, in turn, be simulated by an application of an OSF theory 
normalization rule from Figure 2-4. But this means exactly that <f>N is in ©-solved form. % 

Theorem 3 The weak OSF theory normalization rules are terminating and confluent ( modulo 
a renaming of formula variables). 
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Proof: The number of times a sort definition is unfolded (via Rule (0)) is limited by the number of 
sort and of feature constraints in the OSF clause to be normalized. Let is the OSF clause obtained 
from cj> by doing all these unfoldings, i.e., by adding the OSF clauses C@( s ) [X], obtained by dissolving 
the corresponding ^-terms &(s) and naming its variables apart. Then, using the correspondence 
from the proof of Theorem 2, each OSF theory weak normalization step on <j> can be simulated by an 
OSF clause normalization step on Then, Theorem 7 yields the statement. | 

Theorem 4 The weak OSF theory normalization rules normalize a formula in almost linear 
time (in the size of the formula). 

Proof: We use the simulation of OSF theory normalization by plain OSF clause normalization from 
the preceding proof and the fact that OSF clause normalization is almost linear (the size of each 
unfolded sort definition is assumed constant). | 

Theorem 5 If terminating, the ( strong) OSF theory normalization rules are confluent ( modulo 
a renaming of formula variables). 

Proof: If the (strong) OSF theory normalization is terminating, Rule (9) is applied only a finite 
number of times. Each time, it adds a feature constraint X.l = Z with a new variable Z. Let a be the 
OSF clause of all these feature constraints. Then, $ & a is transformed into the non-bottom normal 
form r N h cj> N by the weak OSF theory normalization rules only, and we can apply Theorem 3. | 

Theorem 6 (Completeness) If <j> is not 6-satisfiable then <j> is reduced to _L by the OSF 

theory normalization rules. 

Proof: Using Lemma 1, if <j> is not 0-satisfiable, then Unfold{(f) is not satisfiable. 

We use the fact (which is a consequence of the compactness theorem [8]) that, given a first-order 
theory T and a set W of open first-order formulae, T U (3) W has a model if and only if, for every finite 
subset F of W, T U (3)Fhas a model. Here, T is given by the axioms Axiom[ sAs i =s ii] and Axiom^Q^ 
specifying the sort hierarchy and the OSF theory. 

Thus, if a possibly infinite OSF clause is not satisfiable, then there exists a finite subset of it that is 
not satisfiable. Now, if $ is not 0-satisfiable, then there exists an index n such that Unfold„((j)) is 
not satisfiable. Let (/>' be the minimal non-satisfiable extension of <j> with sort-unfoldings, i.e., with 
additions of OSF clauses of the form C@( jAj ')[X]. 

According to Theorem 7, the finite OSF clause is reduced to _L using the OSF clause normalization 
rules (l)-(4) of Figure 1. Now, every OSF clause normalization step can be simulated by an OSF 
theory normalization step, under the correspondence described in the proof of Theorem 2. The only 
difficulty is the application of the feature decomposition rule on two feature constraints which both 
come from sort unfoldings, i.e., from added OSF clauses of the form <fr(0(s)). In this case, the 
applicability of Rule (9) has to be shown. But if follows from the fact (Theorem 3) that the weak OSF 
theory normalization are terminating. That is, after finitely many applications of Rules (0) to (8), 
none of them is applicable, and, thus, Rule (9) is. | 

We have divided the normalization processes into two phases. The first phase, consisting 
of the weak normalization rules, is guaranteed to terminate in almost linear time. If the first 
phase ends with the clause still not in normal form then the second phase, one application 
of the strong normalization rule, is performed. From these two phases we derive a complete 
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normalization strategy. Namely, the repeated application of phase one followed by phase two. 
Note that if the process terminates, it terminates in phase one. 

The fact that it is only Rule (9) that leads to undecidability gives us the ability to explore 
what makes certain theories and queries non-terminating. For instance, a loose criterion for 
a theory that guarantees that the normalization of all queries will terminate is that no two 
variables have the same feature symbols. This is clear by looking at Rule (9)'s side conditions. 
It is also clear that more complex, yet decidable, analysis can provide programmers using this 
system with this guarantee. 

Another benefit of the separation is that the terminating rules can be used to "compile" a 
theory by using a partial evaluation technique. Namely, each sort definition can be normalized 
with respect to the theory using the terminating rules only. 

4 Conclusion 

We have presented a formal system of record objects with recursive class definitions 
accommodating multiple inheritance, and equational constraints among feature paths, including 
self-reference. Although the problem of normalizing an object to fit class templates is 
undecidable in general, we have proposed a complete and efficient set of rules to perform this 
normalization whenever it may be done. 

An interesting property of this OSF theory unification process is that it consists of a 
terminating set of rules and an additional one which makes it complete. This property can be 
used to explore the exact situations when the full set of rules will be guaranteed to terminate. 

Appendix 

A A Detailed Example 

Let us take S = {T, s, si, s 2 , S3, _L} ordered minimally such that s 1 A s 2 = S3 and define 0 
as: 



The path-compatibility relation is given by Y Sl JJ. Y Sl , Y\ JJ. 73, Y2 JJ- Y3, their symmetric 
pairs, as well as all reflexive pairs. Therefore, the A operation is given by Y Sl A Y Sl = Y S} , as 
well as yielding the lesser element of all comparable pairs, and giving Y± otherwise. 

Unifying the two ^-terms t\ = s\{t\ =>- s) and t2 = £2(^2 => s) modulo the empty theory 
yields the ^-term (up to variable renaming): 

h A$ h = Sl{l\ => S,l2 => s). 

However, with respect to the theory 0 above, it yields the ^-term (up to variable renaming): 
t 3 = h A© h = si(l\ X : s(l=> s),l 2 X) 
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e{s2) 
e{s3) 
0{s) 



Y Sl -.s^lx 7i : j) 
Y S1 : s 2 {l 2 ^Y 2 :s) 



Y S3 : s 3 {li => Y 3 : s(l Y A : s),l 2 Y 3 ) 
Y s :s(l^Y 5 :s). 
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as illustrated by the following reduction trace. 7 



7 In the derivation sequence that follows, the parts of a contexted formula that make up the redex of the rule to 
apply next are highlighted by overshadowing . 
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From empty context and initial formula: 

0 

h Xf.si & Xili=X[ & X 2 :s 2 & X 2 .£ 2 = X 2 & Xi=Xi 
Frame Allocation [Rule (0)] yields: 
{Xi\Y Sl } 

h Xi:si & X^i^Xj & X 2 :s 2 & X 1 .l 1 = X' 1 & Xi=X 2 
Feature Inheritance [Rule (5)] yields: 
{X,\Y Sl ,X[\Y,} 

h Xi:si & Ijij^Xj & Xjts & X 2 : s 2 & X 2 .£ 2 = X 2 & X = X 2 
Frame Allocation [Rule (0)] yields: 

{Mi^Wi}, {x[\y s } 

h Xi:*! & Xxix^Xj & XJ:« & X 2 : s 2 & X 2 .£ 2 = X 2 & Xi = X 2 
Frame Allocation [Rule (0)] yields: 

{xA^.x'Ar!}, {x[\y s }, {x 2 \y Si } 

h Xi:si & Xi.li^Xj & X[:s & X 2 : s 2 & X 2 .£ 2 = X 2 & Xj = X 2 
Feature Inheritance [Rule (5)] yields: 

{XA^X'AFi}, {X[\Y S }, {X 2 \F S2 ,X 2 \F 2 } 

h Xi:*i & Xi.ii =X[ & X'jii & X 2 :s 2 & X 2 .i 2 = X' 2 & X 2 : s 

& Xj = X 2 

Frame Allocation [Rule (0)] yields: 

{xA^.xAn}, {x[\y s }, {x 2 \y S2 ,x' 2 \y 2 }, {x 2 \y s } 

h Xi:sii & X u ti=X\ & X[:s & X 2 : s 2 & X 2 .£ 2 = X' 2 & X 2 : s 

& Xi = x 2 

Variable Elimination [Rule (3)] yields: 

{Xi\Y Sl ,X[\Yi}, {X[\Y S }, {Xi\Y S2 ,r 2 \Y 2 }, {X' 2 \Y S } 

h Xi:si & Xx.lxiXj & X*:* & Xi:s 2 & X.£ 2 =X 2 & X 2 : s 

& Xi = x 2 

Sort Intersection [Rule (1)] yields: 

{M^x^n} ,{x[\y s }, {xa^,x 2 \}- 2 } ,{x 2 \y s } 

h Xi:i 3 & Xili=X[ & X[:s & Xil 2 =X' 2 & X' 2 : s & X X =X 2 
Frame Merging [Rule (6)] yields: 
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{Xx\Y S3 ,X[\Y U X' 2 \Y 2 }, {X[\Y S }, {X' 2 \Y S } 

h X 1 :s 3 & Xil 1 =X[ & X[:s & Xil 2 =X 2 & X' 2 :s & = X 2 
Feature Inheritance [Rule (5)] yields: 

{X,\Y S3 , X[\Y 3 , X^Fi , X 2 \F 2 }, { 2CJ \Y S }, {X' 2 \Y S } 

h Xi:s 3 & JMi^-Xj & & X\:s & X x .lz=X' 1 & X£ : s 

& x x = x 2 

Sort Intersection [Rule (1)] yields: 

{x x \y S3 , r x \y 3 , W {x' 2 \y s } 

h Zi:i 3 & 11.^=1', & X[:s & X!.£2=^2 & ^2 : s & ^1=^2 
Frame Reduction [Rule (7)] yields: 

h Xi:s 3 & X 1 1 1 =X[ & X^s & X x .l 2 = X' 2 & X 2 :s & = X 2 
Feature Inheritance [Rule (5)] yields: 

{X l \Y S3 ,X' l \Y 3 ,X 2 \Y 3 ,X 2 \Y 2 }, {X[\Y s }, {X' 2 \Y s } 

h Xi:s 3 & X 1 1 1 =X[ & X[:s & Xi.l 2 =^ 2 & X 2 : s & X 2 : s 

& = X 2 

Sort Intersection [Rule (1)] yields: 

{M VA^, 4\^3 , X' 2 \Y 2 }, {X[\Y S }, {X' 2 \Y s } 

h Xi:s 3 & Xi-li^^ & X[:s & Xi.£ 2 =^ 2 & *2 :s & X!=X 2 
Frame Reduction [Rule (7)] yields: 

{Xi\Y S3 , X[\Y 3 , X' 2 \Y 3 }, {X[\Y S }, {X' 2 \Y S } 

h Xi:s 3 & Xili=X[ & X[:s & Xil 2 =X' 2 & X 2 :s & X x = X 2 
Theory Coreference [Rule (8)] yields: 
{M VAJ3}, {X[\Y S }, {X' 2 \Y S } 

h Xjlia & Xj.^^X; & X[:S & Zi.l 2 = X2 & X 2 :s & ^1=^2 

& X} = X' 2 
Variable Elimination [Rule (3)] yields: 
{MVAF3}, {X[\Y S }, {X[\Y S } 

h Xi:s 3 & Xj.^^x; & X[:s & Xi-fei^ & : s & = X 2 

& x;=x 2 

Sort Intersection [Rule (1)] yields: 
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{MWA^}, {X[\Y S }, {X[\Y S } 

h Xi-.ss & Xiii = x[ & x;:s & Xii 2 = x[ & Xj=x 2 & x;=x 2 

Frame Merging [Rule (6)] yields: 

{W*, ^\i3}, {x;\y s } 

h Xi:* 3 & Iiii=^i & X^s & Xi.l 2 =^ & Xi=X 2 & X{ = X' 2 
Theory Feature Closure [Rule (9)] yields: 

{M^, rAr 3 }, {x[\y s } 

h Xj:^ & X 1 .^=X'i & Xj:s & Z 1 .l 2 =X'i & X\.l = Z & Xj = X 2 

& x[ = x' 2 

Feature Inheritance [Rule (5)] yields: 
{X 1 \Y S3 ,r i \Y 3 ,Z\Y 4 }, {X[\Y S } 

h Xi:s 3 & Xili=X[ & X[:s & Zi.l 2 iX' 1 & X[.l = Z & Z : s 

& Xi = X 2 & Xj = X' 2 

Feature Inheritance [Rule (5)] yields: 

{MWA^zW, {X[\Y S ,Z\Y 5 } 

h Xi:* 3 & Xili=X[ & X[:s & Z 1 .I 2 =X' 1 & X\.l = Z & Z:s 

& Z : i & Xj = X 2 & X[=X' 2 

Frame Allocation [Rule (0)] yields: 

{XAWAi3,Z\*4}, {X[\Y S ,Z\Y 5 }, {Z\Y S } 

h X l :s 3 & Xi.li=X\ & X[:s & X 1 J 2 = X' 1 & Xji = Z & Z:.s 

& Z:s & Xi=X z & X[=X' 2 

Sort Intersection [Rule (1)] yields: 

{X 1 \Y S3 ,r i \Y 3 ,Z\Y 4 }, {X[\Y S ,Z\Y 5 }, {Z\Y S } 

h Xi:s 3 & ZiJi^Zj & X{:s & Xi.£ 2 = x; & X[.i = Z & Z:s 

& Xi = x 2 & x; = x' 2 

This is in (strong) ©-normal form, yielding the ^A-term (up to variable renaming): 
h = h A© h = *3^i X : s(l^ s),l 2 => X). 
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B OSF Formalism 

B.1 OSF Algebras 

An OSF Signature is given by (S, <, A, T) such that: 

• S is a set of sorts containing the sorts T and _L ; 

• < is a decidable partial order on S such that _L is the least and T is the greatest element; 

• {S, <) A) is a lower semi-lattice (s A s' is called the greatest common subsort of s and s'); 

• T is a set of feature symbols. 

Given an OSF signature {S, <, A, J 7 ), an OSF algebra is a structure 

such that: 

• D A is a non-empty set, called the domain of A, 

• for each sort symbol s in S, s A is a subset of the domain; in particular, T A = D A and 
± A = 0; 

• (sA s') -4 = s A n s'" 4 for two sorts s and s' in S; 

• for each feature I in J 7 , ^ is a total unary function from the domain into the domain; i.e., 

An OSF homomorphism 7 : i h B between two OSF algebras A and B is a function 
7 : D A i-> D B such that: 

• l[l A [d)) = ^ B (t(^)) for all d 6 D^; 

• j[s A ) Cs B . 

B.2 OSF Terms 

An OSF term t is an expression of the form: 

X : s[l\ t\,...,l n t n ) 

where X is a variable in V, s is a sort in S , l\, ... ,l n are features in T , n > 0, t\, . . . , t n are 
OSF terms, and where V is a countably infinite set of variables. 
Here is an example of an OSF term (call it t person ): 

X : person[name N : T [first F : string), 
name M : id[last S : string), 
spouse P : person[name / : id[last =>■ 5 : t), 
spouse =>• X : t)). 

We shall use a lighter notation, omitting variables that are not shared, and the sort of a 
variable when it is T : 

X : person[name T [first string), 

name id[last S : string), 
spouse =>• person[name =>- id[last =>- 5), 
spouse X)). 
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Given a term t = X : s(li =>- t\,...,l n => t n ), the variable X is called its root variable 
and sometimes referred to as Root(t). The set of all variables occurring in t is denned as 
Var(t) = {Root(t)} U |J" = l V&r(f f ). 

Given a term t as above, an OSF interpretation A, and an A-valuation a:VH I)- 4 , the 
denotation of ? is given by: 

M A ' a = {a(x)} n/n f] (4V(M Aa )- 

Ki<« 

Thus, for all possible valuations of the variables, p]] -4 = U a -Vi->i>' A M" 4 '" • 

A ip-term (or OSF term in normal form) is of the form = X : s(l\ =>- if>i,... ,l n =>- 
where: 

• there is at most one occurrence of a variable Y in ip such that Y is the root variable of a 
non-trivial OSF term (i.e., different than Y : T); 

• s is a non-bottom sort in S; 

• l\ ,...,£„ are pairwise distinct features in T , n > 0, 

• tpi , . . . , tp n are normal OSF terms. 
We call 1^ the set of all -terms. 

For example, the OSF term, 

X : person(name =>- id(first =>- string, 

last =>- 5 : string), 
spouse =>■ person(name =>■ id(last =>■ 5), 
spouse => X)~) 

is a normal OSF term and denotes the same set as t person . 

Definition 3 (OSF Term Subsumption) Lef V ^' ^ OSF terms. r/ie«, i> <i>' ("i> 
is subsumed by if>' ") if and only if, for all OSF algebras A, EV'l" 4 C HV'']]" 4 - 

Given a ^A-term the sort of a variable V G Var(i/;) will sometimes be referred to as 
Sort^^V). Given a variable V G Var(i/;), an occurrence path of V in if> is a string of features 
obtained by concatenating all the features from the root leading to an occurrence of V. We 
call Occ^,(v) the set of all the occurrence paths of V in if>. For example, if if> is the ^-term 
above, then Occ^X) = {e , spouse .spouse} and Occ^,[s) = {name .last , spouse .name .last} . 
The subscript tp will often be omitted for Sort and Occ when the context is clear. 

Here are a few facts about OSF terms. 

• OSF terms generalize first-order terms. First-order terms form a special OSF algebra where 
the sorts form a flat lattice and the features are (natural number) positions. Thus, the 
first-order term/^, . . . , t„), is just the ^-term: f{\ =>- t\, . . . ,n =>- t„). 

• All variables occurring in an OSF term are implicitly existentially quantified at the term 's 
outset (assuming no further outer context). As a corollary, sorts are particular (basic) OSF 
terms: indeed, IX : sJ A = s A since \J a :V^v A {{ a { x )} n 5 " A ) = 5 " A - 

• An OSF term i\> is the empty set in all interpretations if ip has an occurrence of a variable 
sorted by the empty sort _L. 

• Dually, EV'l" 4 = D A in all interpretations A if all its variables occur only once in ip and 
are sorted by T. 
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X : person & X. name = N & N : T & N . first == F & F : string 

& X. name = M & M : id & M. last = S & S : string 

& X. spouse = P & P : person & P . mme == / & I : id 

& I .last = S & S : T 

& P .spouse = X & X: T. 



Figure 5: OSF clause form of OSF term f peraon 

• Features are total functions. If i/j = X : =4> ^>„), and Z ^ Vhr(^),then 
H^J- 4 = p: : Vi> •• • >4 Vfc.* Z : "Oil" 4 for an y feature symbol I £ !F and 
any OSF interpretation A. 

• Variables denote essentially an equality among attribute compositions. For example, 
IX : t(*i 7 : T,l 2 => Y : t)^ a = {d E D A \ l A (d) = l A (d)}. This justifies our 
referring to variables as coreference tags. 

B.3 OSF Clauses 

A logical reading of an OSF term is immediate as its information content can be characterized 
by a simple formula. For this purpose, we need a simple clausal language as follows. 

An OSF constraint is one of (1) X : s, (2) X = X', or (3) X.l = X', where X and X' are 
variables in V, s is a sort in S, and I is a feature in T . An OSF clause is a set of OSF constraints 
(to be interpreted as their conjunction). 

Given A is an OSF algebra, an OSF clause (f> is satisfiable in A, A, a |= <j>, if there exists a 
valuation a:Vn V A such that, for every OSF constraint (/)' in (j>, A, a |= where: 

• A,a\= X : s if and only if a(x) 6 s" 4 ; 

• A a |= X = 7 if and only if a(x) = a(y); 

• i,a j=X7 = Y if and only if l A (a(x)) = a(y). 

B.4 From OSF Terms to OSF Clauses 

We can always associate with an OSF term if> = X : s(li =>- if>i,...,l n =>- if>„) a 
corresponding OSF clause as follows: 

= X:s& X.l\ =X[ &....&. X.l n = X' n 
&</>M &...&^») 

where X[ , . . . , X' n are the roots of tp\ , . . . , if> n , respectively. We say that is obtained from 
dissolving the OSF term if). For example, the non-normal OSF term tp erson of Section B.2 is 
dissolved into the OSF clause shown in Figure 5. It has been shown that the set-theoretic 
denotation of an OSF term and the logical semantics of its dissolved form coincide exactly [4] : 

Ul A = {a(x) | a E Val(A), A,a\= C%{x)} 

where C^[X] is shorthand for the formula X == Root(i/;) & </>{if>), and C^[X] abbreviates the 
formula 3 Var(ip) C^[X]. 
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X : person & X. name = N & N : id 



& N. first = F & F : string 
& N.last = S & S : string 
& X. spouse = P & P : person & P . name = / & I : id 

& / . Za5t = 5 
& P . spouse == X. 



Figure 6: Normal form of OSF clause of Figure 5 



To lighten notation, we shall confuse an OSF term for its dissolved form, writing if) when 
we actually mean </>(if>). 

B.5 OSF Unification 

Definition 4 (Solved OSF Constraints) An OSF clause <j> is called solved if for every 
variable X, (j> contains: 

• at most one sort constraint of the form X : s, with _L < s; and, 

• at most one feature constraint of the form X.l == X' for each I; 

• ifX = X' G (j>, then X does not appear anywhere else in (j>. 

Given an OSF clause (f>, non-deterministically applying any applicable rule among the four 
shown in Figure 1 until none apply will always terminate in a solved OSF clause. A rule 
transforms the numerator into the denominator. The expression </)[X/X'] stands for the formula 
obtained from (f> after replacing all occurrences of X' by X. We also refer to any clause of the 
form X : _L as the inconsistent clause. The following is immediate [4]. 

Theorem 7 (OSF Clause Normalization) The rules of Figure 1 are solution-preserving, 
finite terminating, and confluent (modulo variable renaming). Furthermore, they always result 
in a normal form that is either the inconsistent clause or an OSF clause in solved form. 

For example, the normalization of the OSF clause in the last example leads to the solved 
OSF clause which is the conjunction of the equality constraint M = N and the OSF clause 
shown in Figure 6. The rules of Figure 1 are all we need to perform the unification of two 
OSF terms. Namely, two terms t\ and ti are OSF unifiable if and only if the normal form of 
Root(ti) = Root(t 2 ) & t\ & t 2 is not _L. 

An OSF clause (f> in solved form is always satisfiable in the OSF graph algebra 1^ introduced 
next. As a consequence, the OSF normalization rules yield a decision procedure for the 
satisfiability of OSF clauses. 
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